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Abstract 

Motivated by the theory of inhomogeneous Markov chains, we de- 
termine a sufficient condition for the convergence to of a general 
product formed from a sequence of real or complex matrices. When 
the matrices have a common invariant subspace H, we give a sufficient 
condition for the convergence to on H of a general product. Our 
result is applied to obtain a condition for the weak ergodicity of an in- 
homogeneous Markov chain. We compare various types of contractions 
which may be defined for a single matrix, such as paracontraction, l- 
contraction, and //-contraction, where H is an invariant subspace of 
the matrix. 
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1 Introduction 



Recently there has been much interest in conditions for the convergence of 
infinite products of real or complex matrices. Several investigations have 
concentrated on products taken in one direction - left or right, see for ex- 
ample the recent papers by Beyn and Eisner || and Hartfiel and Rothblum 
0. However, in this paper, we are concerned with general products formed 
from a given infinite sequence of matrices. These are defined further on in 
the paper and they have previously been considered for nonnegative and for 
stochastic matrices by Seneta in Chapters 3.1 and 4.6]. 

Our principal result is a sufficient condition for the convergence to of 
infinite general products of matrices. We pay particular attention to the 
case where there is a common invariant subspace for all the matrices in the 
product. As a special case, we obtain a result on the weak ergodicity of an 
inhomogeneous Markov chain. 

The investigations described above are preceded by a study of the inter- 
relations of several types of contractions which may be defined for a single 
matrix. 



The motivation for our study comes from the theory of inhomogeneous 
Markov chains, see [15j and [[16], Chapter 4] for much further background ma- 



terial. This theory naturally leads to the study of the convergence, ergodicity 
and weak ergodicity of products of stochastic matrices. 



We now describe our paper in more detail. First, in Section 2, we sharpen 
several observations concerning paracontraction. This we shall achieve by in- 
troducing a partial norm on the matrix restricted to an invariant subspace 
H and the consequent notion of H contraction. We relate this concept to 
the notion of paracontraction as introduced by Nelson and Neumann in [T^ 



and to the notion of /-contraction recently introduced by Beyn and Eisner 
in 0. 



Second, in Section 3, which is essentially independent of Section 2, we 
turn to our main results. We formulate our sufficient conditions depending 
on norms for the convergence to of infinite general products of matrices 
which are formed from a given infinite sequence of matrices. We then apply 
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the results to examine the convergence to of infinite general products on a 
common invariant subspace of the matrices. 

In Section 4 we specialize our results and we investigate products of 
stochastic matrices to obtain a suffcient condition for weak ergodicity. The 
i\ coefficient of ergodicity due to Bauer, Deutsch, Stoer |T[ plays a special 
role here. 

In Section 5 we give bounds on the algebraic and geometric multiplicities 
of the eigenvalues of the restriction of a matrix to an invariant subspace in 
terms of these quantities for the entire matrix. These are related to familiar 
results on stochastic matrices. 



2 Partial norms and Paracontractions 

In this paper F will stand for the real field R or the complex field C. We 
begin by recalling the following definition: 



Definition 2.1 (Nelson and Neumann [p|| ) Let A e F n,n and let v be a 
vector norm on F n . Then A is paracontracting with respect to v if 



v(Ax) < v(x) whenever Ax ^ x. 



(1) 



In particular we see that one immediate consequence of the definition is 
that u°(A) < 1, where u°(A) denotes the operator norm corresponding to the 
norm v. For a given matrix norm let M v denote the set of all paracontracting 
matrices with respect to v in F n . Nelson and Neumann show in [13] that if 
A e Nu, then lim^oo A 1 exists and that AB e N v if A, B e N v . 



We now introduce the concept of a partial norm with respect to a sub- 
space and the concept of H-contraction: 



Definition 2.2 Let if be a subspace of F n and let A e F nn be a matrix 
which leaves H invariant. Let v be a norm on F n and define u° to be the 
operator norm induced by v. Then 

vl{A) = sup (2) 
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We call Vjj{A) a partial norm of A with respect to the invariant subspace H 
of A. 

Definition 2.3 The matrix A e F n ' n is nonexpansive with respect to the 
vector norm v on F n if u°(A) < 1. The matrix A is an H - contractor if it is 
nonexpansive, if is an invariant subspace of A, and f#(^4) < 1. 

We shall denote the range of A e F nn by 7£(A) and the nullspace of A 
byM(A). 

Remark 2.4 If A e F n,n is nonexpansive with respect to z/ then the spectral 
radius p(A) of A satisfies p(A) < u°(A) < 1. If 1 is an eigenvalue of A, then 
we must have that u°(A) = p(A) and it follows that all Jordan blocks of A 
corresponding to 1 are lxl, (viz. index (A) = 1), e.g., Mott and Schneider 
]T2| . Hence K = M(I — A) and H = 71(1 — A) are complementary invariant 
subspaces. 

We next show the connection between paracontraction and if-contraction. 

Lemma 2.5 Let A e F n ' n be paracontracting with respect to v and let K be 
the subspace of all its fixed points. Then H = 71(1 — A) is invariant under A, 
complementary to K, and such that v%(A) < 1 so that A is an H -contractor. 



Proof : It follows by Remark 2A that H is invariant under A and com- 



plementary to K. Since A is paracontracting with respect to u, we have that 
u°(A) < 1. In view of (|I|), it now follows that v(Ax) < u(x), for all x e H, 
and so since the unit ball U of v° H in H is compact, 



maxz/(Ac) < 1. 

Hence v° H (A) < 1. □ 
Recently Beyn and Eisner || have introduced the notion of /-paracontraction: 



Definition 2.6 (0) A matrix A G F n,n is /-paracontracting with respect 
to the vector norm v on F n if there exists a 7 > such that 

v(Ax) < v(x)-'yv(Ax-x), Va; e F n . (3) 
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In their paper Beyn and Eisner establish several conditions which are 
equivalent to /-paracontraction. Clearly /-paracontraction implies paracon- 
traction. We next show that H-contraction implies /-paracontraction for a 
suitable chosen norm and a suitably chosen subspace H. 



Theorem 2.7 Let A G F n ' n and suppose that A is nonexpansive with respect 
to the norm v onF n . Let H = 71(1 — A) and let K = J\f(I — A). If the norm 
v satisfies 

u(y + z) = v(y) + u(z), for y G H, z G K, (4) 
then the following are equivalent: 

(i) A is I -paracontracting with respect to v . 

(ii) A is paracontracting with respect to v. 

(iii) A is an H-contraction with respect to v. 
Proof: (i) =^> (ii): Obvious. 



(ii) (iii): By Lemma |2T5| . 

(iii) (i): Since A is nonexpansive, we have by Remark [2.3| that K 



H = F n . For x e F n , write 

x = y + z, y G H, z E K. 

Let k = Vjj(A) and note that k < 1 by (iii). Since x — Ax = y — Ay G H it 
follows that 

v(x-Ax)=v((I-A)y) < u° H (I-A)u(y) < (l + k)u(y). (5) 

Moreover, using (^), we obtain 

v(x) — v(Ax) = v(y + z) — v(Ay + z) = v(y) — v(Ay) > (l — k)v(y). (6) 

We combine (J5|) and (||) and we deduce that 

1 + k 

vix-Ax) < (l + k)v(y) < -(v(x) -v(Ax)). 

1 — k 

and this is equivalent to (R|) with 7 = (1 — k)/(l + k). □ 
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Remark 2.8 If v' is a norm on F n , then we may define a norm v that satisfies 
(j^) and agrees with v' on H and K by setting u(y + z) — v\y) + v\z) for 
y E H and z £ K. 



Remark 2.9 The implication (iii) =^> (ii) of Theorem [2.7| holds under weaker 
assumptions on the norm v. Specifically, if A is an if-contraction with re- 
spect to v and the condition 

u(y) < v{y) u(y + z) < v{y' + z), y G H, zeK 

is satisfied, then A can be shown to be paracontracting with respect to v. 



3 Convergence of Infinite Products 



In this section we develop our main results concerning the convergence of 
products of complex matrices taken in an arbitrary order from an infinite se- 
quence of matrices. Such products were considered (in a slightly less general 
form) in Seneta J16], Section 4.6] in the case of stochastic matrices, see also 
H and PJ. 



Let Ax, Ai, ... be a sequence of complex matrices. We shall consider prod- 
ucts of matrices obtained from the sequence in the following manner: First 
choose some permutation of the given infinite sequence to obtain a sequence 
Bx, B 2 , . . .. Then form the products C Pjr of the matrices B p+1 , . . . , B p+r in 
some order. We shall call C p r a general product from the sequence Ai,A 2 ,... 
and we shall consider the existence of lim^oo C Ptr . If this limit is 0, for all 
permutations of Ax, A 2 , ■ ■ ■ and all p, p > 0, then we shall say that all general 
products from the the sequence Ax, A 2 , . . . converge to 0. 

As an example of a sequence of general product suppose the chosen order 
is A 9 ,A 7 ,A 5 , Ax4,A 2 ,.... Then the sequence of (6*2,1, C 2>2 , .. .) may begin 
thus: 

C 2j x = Aj, 



C 2>2 — AtA 5 , 

62,3 = ^45^2^7, 
C 2A = A 2 A 7 A 14 A 5 . 
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Note that, for a given sequence C Pi i, C p>2 , ... of general products each factor 
of C PjT occurs in C P)r+ i, but the order in which the factors occur in Cp r is 
arbitrary. 

Let n be a matrix norm (viz. a submultiplicative norm on F nn ) and 
denote 

V + (P) = max(//(P),l) and /i~(P) = min(/x(P), 1). 

Now let Ai, A2, ... be a sequence of matrices in F nn and let \i be a matrix 
norm. We now define two conditions: 



Condition (C) : We say that the sequence Ax, A 2 , . . . , satisfies Condition 
(C) for the norm /i if 

oo 

^(/i + (A;) — 1) converges. (7) 
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Condition (D) : We say that the sequence Ai,A2,... satisfies Condition 



(D) for the norm /x if 



^2(l-fi~(Ai)) diverges. (8) 



We are now ready to prove the following result: 



Proposition 3.1 Let Ax,A 2 ,... be a sequence of matrices in F nn . Let fi 

be a matrix norm on F nn . Suppose that the sequence Ai,A 2 ,... satisfies 
Condition (C) for the norm /i. Then all general products from Ai,A 2 , . . . 
are bounded. 

Proof : Let Bi, B 2 , . . . be a permutation of A±, A 2 , . . . and let C PtT be a 
product of B p+ i, . . . , B p+r in some order. By Condition (C)and || Theorem 
14], J^^ii^ (Bi) — 1) converges and hence (B p+i ) — 1) also converges. 
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Thus, by |], Theorem 51], the product YiiZi (B p+i ) converges and so there 
exists a positive constant M such that Y\ r i= i ^ + {B p+i ) < M, for each r G 
{1,2, . . .}. It follows that 

(J> (Cp,r) < A* {Bp+l) '"I 1 {Bp+r) 

= [ui =1 ^(b p+1 )][u: =1 ^(b p+1 )] 

(9) 

< M [U: =1 fi- (B p+l )] 
< M. 

□ 

The above proposition allows us to prove a stronger result under an addi- 
tional conditions. Note that in the theory of infinite products of nonnegative 
numbers it is customary to speak of divergence to 0, see e.g. || p. 93]. 



Theorem 3.2 Let A\, A2, ... be a sequence of matrices in F nn . Let \x be a 
matrix norm on F nn . Suppose that the sequence A\,A2,... satisfies Condi- 
tions (C) and (D) for the norm \i. Then all general products from Ai, A 2 , . . . 
converge to 0. 

Proof : Let B\, B2, . . . be a permutation of A\, A2, ■ ■ ■ and let C p ^ r be a 
product of B p+ i, . . . , B p+r in some order. As in the proof of Proposition( |3.1|) , 
we have that 



n(C Ptr ) < M 



n a* - (Bp, 



i=l 



(10) 



By Condition (D), the sum — t 1 (A)) diverges and so, by R 

Theorem 14], E 4 =i(l - /^(A)) diverges. Thus E£i(l - V~(B p+i )) also 
diverges. 

We again apply || Theorem 51], and we obtain that Yiili ^ (B p+i ) di- 
verges. But since /j,~(Bi) < 1, the last product must diverge to and the 
proof is done. □ 

If if is a subspace of F n which is invariant under A e F n ™, we denote the 
restriction of A to H by A\ H . As an immediate application of Theorem |3T2 
we obtain the following result: 
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Corollary 3.3 Let Ai, A 2 , . . . be a sequence of matrices and let C P) i, C Pi 2, • • • 
be sequence of general products from the Ai, A 2 , . . .. Let H be a subspace of 
F n which is invariant under each Ai, i — 1, . . .. Let v be a norm on F n . 
Suppose that the sequence ((Ai)\g) satisfies conditions (C) and (D) for the 
norm u%. If x £ H, then 

lim C vr x = 0. 



Proof : Immediate by Theorem FO. □ 



Since v*h(A) < u°(A) when H is an invariant subspace of A, it follows 
easily that under the conditions of Corollary( |3.3Q , Condition (C)(resp. Con- 
dition (D)) on the sequence of (Ai) implies Condition (C)(resp. Condition 
(D)) on the sequence of ((Ai)\ H ). 



4 Applications to Stochastic Matrices 

In this section we apply the foregoing results to stochastic matrices. In 
order to be consistent with our previous section we consider column stochas- 
tic matrices. Thus "stochastic matrix" will mean "column stochastic matrix" . 

Let e=(l,...,l) T £ R n . Hence in this section we shall assume that 

H = {xeW : e T x = 0}. (11) 

If A is a stochastic matrix in R n ' n , then H is invariant under A, but note 
that H need not be a complement to the space of fixed points of A. If v is a 
norm on R n and H is given by (|TTD, then the corresponding partial norm Ujj 
on R n ™ will be called a coefficient of ergodicity as is usual in the literature 
on Markov chains. 



The l\ norm on R n plays a special role in this theory and we shall denote it 
henceforth by u. The corresponding coefficient of ergodicity was apparently 
first computed in Bauer, Deutsch, and Stoer see also |T7J, and equals 

n 

Wh(A) = (1/2) max V \a itj - a k J. 

i,k£\ l,...,n t ■ , 

It is known that u%(A) < 1 for all stochastic matrices A and u Q H is the only 
coefficient of ergodicity that satisfies this inequality, and 0, but see also 
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Remark 4.1 We comment that the only stochastic matrix A which is para- 
contracting with respect to the t\ norm u is the identity matrix. This follows 
easily from the facts that for any nonnegative vector u with u(u) = 1, we also 
have that uj(Au) = 1. On the other hand, every stochastic matrix that has 
a row with two or more positive elements is H-contracting for the subspace 
H given in ([11]) and the norm u. 



Definition 4.2 Let Pi,P2, ... be a sequence ofnxn stochastic matrices. 
We shall say that all general products formed from this sequence are weakly 
ergodic if for all general products P p ,i, B p 2 , . . ., we have that 

lim r ^ OQ B v r x = 0, for all x G H. (12) 



Since every x G H can be written x 



c(u — v), where u, v G R n are 
nonnegative and e 1 u = e 1 v = 1 and c 6 R, it is easily seen that, for each 
product considered, our definition is equivalent to that in || 
Defn. 3.3]. 



or 
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By Theorem EO we now immediately obtain: 



Theorem 4.3 Let v be a norm on R n and let v\ be the corresponding coef- 
ficient of ergodicity. Let Pi,P 2 , . . . be a sequence ofnxn stochastic matrices. 
Then all general products formed from this sequence are weakly ergodic if 

CO 

^2(vh (Pi) ~~ 1) converges (13) 
i=i 

and 

y^(l — (Pi-)) diverges for some subsequence of Pi, P 2 . . . . (14) 
i=i 
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Note that (|T3"| ) is automatically satisfied if v = u, the ^-norm. This spe- 
cial case of Theorem |4.3| is observed in fll(], Exercise 4.36]. Results related to 
this case (and which therefore involve only fll4] ) explicitly) are to be found 
15 , Theorem 1] and in [11 1. The theorem in the latter paper is there 



in 



illustrated by an example of a sequence of stochastic matrices that satisfies 
for the norm u, see [11, p. 333]. 



The following corollary is due to Rhodius [[14], Thm. 3, Part I] in the case 
of v = u, see Leizarowitz || Thm A (i)] for a related result. 

Corollary 4.4 Let Pi, P 2 , . . . be a sequence of stochastic matrices. If z/^-(Pj) 
< 1 for all i, i = 1, 2, . . ., and there exists a point of accumulation c of the 
sequence z^(-Pi), Uf I (P 2 ), . . . such that c < 1, then all general products of the 
sequence are weakly ergodic. 

Proof : Clearly condition (|T3"D holds and there is an infinite subsequence 
ji,j2, ■ ■ ■ such that ^(Pij) < (1 + c)/2 < 1, j — 1,2, .. .. Then condition 
( 14]) holds for this subsequence. The result follows from Theorem ( |4.3j ). □ 



Another corollary of Theorem |4.3| is H, Thm A (ii)]: 



Corollary 4.5 Let Pi, P 2 , ... be a sequence of stochastic matrices and let v 
be a norm on R n . If all points of accumulation c of the sequence v^j{Pi), 
u^j{P 2 ), . . . satisfy c < 1, then all general products of the sequence are weakly 
ergodic. 

Proof : Since the set of accumulation points of a bounded sequence is 
compact, there exists d < 1 such that only a finite number of terms of the 
sequence z/^(Pi), i^(P 2 ), . . . exceed d. Hence (|13]) and ( |I"4"D hold for the se- 
quence of ergodicity coefficients and the corollary follows from Theorem O. 
□ 



5 Bounds for eigenvalues 

Theorem 5.1 Let H be an invariant subspace for A e fc n > n _ £ e £ a>\(A) 
and 7a (A) be the algebraic and geometric multiplicities of A G C nn as an 
eigenvalue of A, respectively. Then 

oc\(A) < a x {A\ H ) +n - dim(#) 
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and 

lx(A) < 7A (A|J+n-dim(#). 

Further, if fi is a matrix norm and |A| > fj,° H (A), then a\(A) < n-dim(H) 
and 7a(^4) < n — dim(iJ). 

Proof : By a proper choice of basis for C n we may put A in the form 



^2,1 ^2,2 

and the first inequality follows immediately. 

The second inequality is an immediate consequence of JTO, Theorem 2.1], 
see also 0, Cor. 5.8]. 

Finally, since all eigenvalues A of A 2 2 satisfy |A| < fi° H (A), the last part 
of the theorem follows. □ 



Remark 5.2 Theorem ( |5.1| ) implies several known results. We now give an 
example below which could also be derived from j|, Theorem 2]. Let A be 
a stochastic matrix, let H be defined by (|ITD , and suppose that a coefficient 
of ergodicity satisfies v%{A) < 1. Since, in this case, the matrix A\ t \ in (15 



is 1 x 1, it follows that 1 is an algebraically simple eigenvalue of A and all 
other eigenvalues A of A satisfy v° H {A) > |A|. 
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